Biclustering of gene expression data by an extension of mixtures of factor analyzers.
نویسندگان
چکیده
A challenge in microarray data analysis concerns discovering local structures composed by sets of genes that show homogeneous expression patterns across subsets of conditions. We present an extension of the mixture of factor analyzers model (MFA) allowing for simultaneous clustering of genes and conditions. The proposed model is rather flexible since it models the density of high-dimensional data assuming a mixture of Gaussian distributions with a particular omponent-specific covariance structure. Specifically, a binary and row stochastic matrix representing tissue membership is used to cluster tissues (experimental conditions), whereas the traditional mixture approach is used to define the gene clustering. An alternating expectation conditional maximization (AECM) algorithm is proposed for parameter estimation; experiments on simulated and real data show the efficiency of our method as a general approach to biclustering. The Matlab code of the algorithm is available upon request from authors.
منابع مشابه
Mixtures of common t-factor analyzers for clustering high-dimensional microarray data
MOTIVATION Mixtures of factor analyzers enable model-based clustering to be undertaken for high-dimensional microarray data, where the number of observations n is small relative to the number of genes p. Moreover, when the number of clusters is not small, for example, where there are several different types of cancer, there may be the need to reduce further the number of parameters in the speci...
متن کاملبه کارگیری خوشهبندی دوبعدی با روش «زیرماتریسهای با میانگین- درایههای بزرگ» در دادههای بیان ژنی حاصل از ریزآرایههای DNA
Background and Objective: In recent years, DNA microarray technology has become a central tool in genomic research. Using this technology, which made it possible to simultaneously analyze expression levels for thousands of genes under different conditions, massive amounts of information will be obtained. While traditional clustering methods, such as hierarchical and K-means clustering have been...
متن کاملEffect of different concentrations of leukemia inhibitory factor on gene expression of vascular endothelial growth factor-A in trophoblast Tumor Cell Line
Background: Several studies have shown that leukemia inhibitory factor (LIF) is one of the most important cytokinesparticipating in the process of embryo implantation and pregnancy, while, the role of this factor on vascular endothelialfactor-A (VEGF-A), as one of the most important angiogenic factor, has not been fully investigated yet. The aimof this study was to evaluate th...
متن کاملThe Effect of Aerobic Training on Tumor Necrosis Factor alpha, Hypoxia-Inducible Factor-1 alpha & Vascular Endothelial Growth Factor Gene Expression in Cardiac Tissue of Diabetic Rats
Objective: The goal of this research was to determine the influence of 4 weeks aerobic training on gene expression of tumor necrosis factor alpha (TNF-α), hypoxia-inducible factor-1 alpha (HIF-1α) and vascular endothelial growth factor (VEGF) in the cardiac tissue of diabetic rats. Materials and Methods: In an experimental study, 30 male wistar rats were partitioned into three groups (n=10), d...
متن کاملBiclustering Gene Expressions Using Factor Graphs and the Max-Sum Algorithm
Biclustering is an intrinsically challenging and highly complex problem, particularly studied in the biology field, where the goal is to simultaneously cluster genes and samples of an expression data matrix. In this paper we present a novel approach to gene expression biclustering by providing a binary Factor Graph formulation to such problem. In more detail, we reformulate biclustering as a se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- The international journal of biostatistics
دوره 4 1 شماره
صفحات -
تاریخ انتشار 2008